Not Seeing the Forest for the Trees: Size of the Minimum Spanning Trees (MSTs) Forest and Branch Significance in MST-Based Phylogenetic Analysis
نویسندگان
چکیده
Trees, including minimum spanning trees (MSTs), are commonly used in phylogenetic studies. But, for the research community, it may be unclear that the presented tree is just a hypothesis, chosen from among many possible alternatives. In this scenario, it is important to quantify our confidence in both the trees and the branches/edges included in such trees. In this paper, we address this problem for MSTs by introducing a new edge betweenness metric for undirected and weighted graphs. This spanning edge betweenness metric is defined as the fraction of equivalent MSTs where a given edge is present. The metric provides a per edge statistic that is similar to that of the bootstrap approach frequently used in phylogenetics to support the grouping of taxa. We provide methods for the exact computation of this metric based on the well known Kirchhoff's matrix tree theorem. Moreover, we implement and make available a module for the PHYLOViZ software and evaluate the proposed metric concerning both effectiveness and computational performance. Analysis of trees generated using multilocus sequence typing data (MLST) and the goeBURST algorithm revealed that the space of possible MSTs in real data sets is extremely large. Selection of the edge to be represented using bootstrap could lead to unreliable results since alternative edges are present in the same fraction of equivalent MSTs. The choice of the MST to be presented, results from criteria implemented in the algorithm that must be based in biologically plausible models.
منابع مشابه
Network Analysis on Safety Culture and Worker‘s Behaviour : A Forest of All Minimum Spanning Trees
In this paper safety culture and worker’s are considered, all together, as a complex system and statistically represented in the form of correlation network among their characteristics. We show that the current practice, based on a minimal spanning tree (MST), to filter the information contained in the network is not robust. A robust filter based on the forest of all possible MSTs is then propo...
متن کاملThe effect of ecophysiography on the quantitative characteristics of DBH, height, basal area, crown diameter and canopy area of trees in mountain forest communities (Case study: Oak-hornbeam community in Arasbaran forest)
Ecophysiography is the geography of the earth and the relationship between physiography and the ecosystem. Ecophysiography is a basis for planning processes to study the characteristics of terrestrial systems concerning the interactions between terrestrial physiography and living organisms. Due to the current state of ecosystems and the increase in natural disasters for ecosystem sustainability...
متن کاملRelationship between Dead Trees with Soil Physico-chemical Properties and Earthworm in Mixed Broad-leaved Forest Stand (Case study: Sarcheshmeh Forest, Chaloos)
Dead trees protection, has a key role in structural and biogeochemical processes in forest ecosystems. Some aspects of dead tree dynamics have been carefully studied, but the kind and decay degree of dead trees and forest soil properties have not received enough attention. The aim of this research was to study the effect of a kind and decay degree of dead trees on soil mineral properties in the...
متن کاملSelecting optimal minimum spanning trees that share a topological correspondence with phylogenetic trees
Choi et al. (2011) introduced a minimum spanning tree (MST)-based method called CLGrouping, for constructing tree-structured probabilistic graphical models, a statistical framework that is commonly used for inferring phylogenetic trees. While CLGrouping works correctly if there is a unique MST, we observe an indeterminacy in the method in the case that there are multiple MSTs. In this work we r...
متن کاملSpecies Diversity of Trees and Forest Floor Plants in Oriental beech Forest Types of Shastkalate Educational and Research Forest, Gorgan)
Trees are the most important biological elements of forest ecosystems. The variability of the tree species composition inhabiting in the Oriental beech forest, not only forms different forest types but also has a remarkable impact on the species diversity of forest floor plants, due to the existence of trees in the overstory layer. In this research, forest types of an an Oriental beech were ide...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 10 شماره
صفحات -
تاریخ انتشار 2015